Approximate Kalman Filters for Embedding Author-Word Co-occurrence Data over Time
نویسندگان
چکیده
We address the problem of embedding entities into Euclidean space over time based on co-occurrence data. We extend the CODE model of Globerson et al. (2004) to a dynamic setting. This leads to a non-standard factored state space model with real-valued hidden parent nodes and discrete observation nodes. We investigate the use of variational approximations applied to the observation model that allow us to formulate the entire dynamic model as a Kalman filter. Applying this model to temporal co-occurrence data yields posterior distributions of entity coordinates in Euclidean space that are updated over time. Initial results on per-year co-occurrences of authors and words in the NIPS corpus and on synthetic data, including videos of dynamic embeddings, seem to indicate that the model results in embeddings of co-occurrence data that are meaningful both temporally and contextually.
منابع مشابه
A Latent Space Approach to Dynamic Embedding of Co-occurrence Data
We consider dynamic co-occurrence data, such as author-word links in papers published in successive years of the same conference. For static co-occurrence data, researchers often seek an embedding of the entities (authors and words) into a lowdimensional Euclidean space. We generalize a recent static co-occurrence model, the CODE model of Globerson et al. (2004), to the dynamic setting: we seek...
متن کاملLatent Topic Embedding
Topic modeling and word embedding are two important techniques for deriving latent semantics from data. General-purpose topic models typically work in coarse granularity by capturing word co-occurrence at the document/sentence level. In contrast, word embedding models usually work in fine granularity by modeling word co-occurrence within small sliding windows. With the aim of deriving latent se...
متن کاملImage Steganalysis Based on Co-Occurrences of Integer Wavelet Coefficients
We present a steganalysis scheme for LSB matching steganography based on feature vectors extracted from integer wavelet transform (IWT). In integer wavelet decomposition of an image, the coefficients will be integer, so we can calculate co-occurrence matrix of them without rounding the coefficients. Before calculation of co-occurrence matrices, we clip some of the most significant bitplanes of ...
متن کاملIdentification of Geologic Fault Network Geometry by Using a Grid-Based Ensemble Kalman Filter
Discrete geologic features such as faults and highly permeable embedded channels can significantly affect subsurface flow and transport characteristics. Therefore, they must be properly identified, parameterized, and represented in subsurface simulation models. In this work, we use an improved ensemble Kalman filter (EnKF) for history-matching fault network geometry from production data. EnKF i...
متن کاملIMPLEMENTATION OF EXTENDED KALMAN FILTER TO REDUCE NON CYCLO-STATIONARY NOISE IN AERIAL GAMMA RAY SURVEY
Gamma-ray detection has an important role in the enhancement the nuclear safety and provides a proper environment for applications of nuclear radiation. To reduce the risk of exposure, aerial gamma survey is commonly used as an advantage of the distance between the detection system and the radiation sources. One of the most important issues in aerial gamma survey is the detection noise. Various...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006